Document Layout Analysis


Document layout analysis (DLA) is the process of analyzing a document's spatial arrangement of content to understand its structure and layout. This includes identifying the location of text, tables, images, and other elements as well as the overall structure, such as headings and subheadings. DLA helps in extracting and categorizing information and automating document processing workflows.

Graph-based Document Structure Analysis

Add code
Feb 04, 2025
Viaarxiv icon

Docling: An Efficient Open-Source Toolkit for AI-driven Document Conversion

Add code
Jan 27, 2025
Figure 1 for Docling: An Efficient Open-Source Toolkit for AI-driven Document Conversion
Figure 2 for Docling: An Efficient Open-Source Toolkit for AI-driven Document Conversion
Figure 3 for Docling: An Efficient Open-Source Toolkit for AI-driven Document Conversion
Figure 4 for Docling: An Efficient Open-Source Toolkit for AI-driven Document Conversion
Viaarxiv icon

MMDocIR: Benchmarking Multi-Modal Retrieval for Long Documents

Add code
Jan 15, 2025
Viaarxiv icon

S2 Chunking: A Hybrid Framework for Document Segmentation Through Integrated Spatial and Semantic Analysis

Add code
Jan 08, 2025
Viaarxiv icon

HAND: Hierarchical Attention Network for Multi-Scale Handwritten Document Recognition and Layout Analysis

Add code
Dec 25, 2024
Viaarxiv icon

DoPTA: Improving Document Layout Analysis using Patch-Text Alignment

Add code
Dec 17, 2024
Figure 1 for DoPTA: Improving Document Layout Analysis using Patch-Text Alignment
Figure 2 for DoPTA: Improving Document Layout Analysis using Patch-Text Alignment
Figure 3 for DoPTA: Improving Document Layout Analysis using Patch-Text Alignment
Figure 4 for DoPTA: Improving Document Layout Analysis using Patch-Text Alignment
Viaarxiv icon

LongDocURL: a Comprehensive Multimodal Long Document Benchmark Integrating Understanding, Reasoning, and Locating

Add code
Dec 24, 2024
Viaarxiv icon

Zero-Shot Prompting and Few-Shot Fine-Tuning: Revisiting Document Image Classification Using Large Language Models

Add code
Dec 18, 2024
Viaarxiv icon

SAIL: Sample-Centric In-Context Learning for Document Information Extraction

Add code
Dec 22, 2024
Viaarxiv icon

OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations

Add code
Dec 10, 2024
Figure 1 for OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations
Figure 2 for OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations
Figure 3 for OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations
Figure 4 for OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations
Viaarxiv icon